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Receiver and Low Power Digital Filter Therefor 

Technical Field 

5 This invention relates to digital communication technology, and in particular to a 

filter for correlating known coefficients with a data sequence. 
Background of the Invention 

In a spread spectrum communication system, the transmitted signal is spread over a 
frequency band that is significantly wider than the minimum bandwidth required to transmit 
10 the signal. By spreading transmission bandwidth across a broader bandwidth than minimally 
required, power levels at any given frequency within the bandwidth are significantly reduced. 
In one type of spread spectrum communication system, known as a direct sequence code 
^division multiple access (CDMA) modulation system, a radio frequency (RF) carrier is 
^modulated by a known digital code sequence referred to as a spreading code. The spreading 
15 1 1| code has a bit rate, or chipping rate, that is much higher than a clock rate of the underlying 
^information signal. The RF carrier may be binary or quadrature modulated by one or more 
uAdata streams. The data streams have one phase when the spreading code represents a data 
j =i"one" or "high" and a predetermined phase shift (e.g., 180° for binary modulation and 90° for 
^ ' quadrature modulation) when the spreading code represents a data "zero" or "low". These 
20 riiypes of modulation techniques are commonly referred to as binary shift key modulation 
: ;-J(BPSK) and quadrature shift key modulation (QPSK), respectively. 

To retrieve the data encoded in a transmitted signal upon being received as a received 
signal, CDMA systems despread received signal samples with the same spreading code as 
was used to encode the data. CDMA systems employ a filter for correlating the spreading 
25 code with received signal samples to determine the received signal samples in a multi-chip 
data sequence. 

A low power consuming analog filter for CDMA systems is disclosed in Low-Power 
Consuming Analog-Type Matched Filter for DS-CDMA Mobile Radio, by M. Sawahashi, F. 
Adachi, G. Shou, and C. Zhou, published in IEICE Transactions Fundamentals, Vol. E79-A, 
30 No. 12, Dec. 1996, pp2071-2077. 
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In one known matched filter, multi-bit received signal samples are clocked through a 
delay line comprised of a plurality of delay stages, such as a shift register. The delays in the 
shift register are of a multiple bit width sufficient to accommodate the received signal 
samples. Each of the delay stages provides a delay of less than one-half of the period of the 
5 spreading code clock, or chipping rate, to satisfy the Nyquist sampling theorem. The signal 
samples propagate down the delay line through a series of successive shifts at a rate 
corresponding to the chipping rate. Taps at each delay stage provide the delayed signal 
samples for multiplication by respective tap weights associated with the respective delay 
stages to produce product terms, which when summed, provide the filter output. The tap 
10 weights represent the spreading code used to encode information on the transmission signal 
prior to being transmitted. The filter output is a correlation of the received signal samples 
with the spreading code represented by the tap weights. 

Each time a new received signal sample is clocked into the delay line, each stage of 
•^the delay line is clocked to shift the previously received signal samples along the delay line 
1 5 illby, one delay stage. A received signal sample that has shifted completely through the delay 
yjine is shifted out of the delay line, as is known in the art. A shortcoming of this straight- 
l & 'forward implementation is the amount of power consumed by the delay line due to each stage 
r Jgof the delay line being clocked each time a new received signal sample is introduced into the 
•y delay line. 

20 w U.S. Patent No. 6,075,807 teaches a digital matched filter for CDMA systems that 
j ; include a digital delay line having a plurality of successive delay stages adapted to receive a 
digital signal and propagate the digital signal therethrough at a fixed rate. A correlator is 
coupled to the delay line to correlate the digital signal to a predefined spreading code to 
provide a correlation signal representing a degree of correlation of the digital signal to the 

25 spreading code. A window logic unit is coupled to the correlator to enable operation of the 
correlator only during successive discrete time periods of the correlation signal 
corresponding to a high degree of correlation of the digital signal to the spreading code. 
Summary of the Invention 

In accordance with the invention or a digital filter or a receiver including a digital 

30 filter includes at least two multiple stage shift registers. A plurality of multipliers 

corresponding in number to the number of stages in the at least two multiple stage shift 
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registers receive as a first input an output from a corresponding stage of the at least two 
multiple stage shift registers. A tap weight shifter is coupled to a tap weight source to 
receive tap weights. The tap weight shifter is coupled to provide a second input to each 
multiplier. Each multiplier produces an output that is the product of inputs thereto. An adder 
5 sums the multiplier outputs to provide a sum output. The tap weight shifter then circularly 
shifts the tap weights and another multiply-add operation occurs. Several shift/multiply/add 
cycles may occur before data is again shifted into the at least two multiple stage shift 
registers, and another multiply-add operation occurs. 

10 Brief Description of the Drawing 

Figure 1 is a simplified block diagram of a receiver including a digital filter for 
despreading a received signal in accordance with the present invention; 
't| Figure 2 is a simplified block diagram of a low power digital filter for despreading a 
■^received signal in accordance with the present invention that may be employed in the 
15 ^receiver of Figure 1; and 

X Figure 3 is a simplified block diagram of an alternate embodiment low power digital 

1 ^filter for despreading a received signal in accordance with the present invention that may be 
^employed in the receiver of Figure 1; and 

Figure 4 is a simplified block diagram of yet another alternate embodiment low 
20 ^ower digital filter in accordance with the present invention in which the number of taps is 
ynot a product of two integer numbers. 
Detailed Description 

Figure 1 represents a receiver 10 employing an illustrative embodiment of the 
invention. Receiver 10 may be a handset, a base station, or any receiver employing a 
25 matched digital filter. Receiver 10 receives a radio frequency (RF) signal on antenna 20 and 
down converts the RF signal to a baseband signal by multiplying the received signal by a 
carrier frequency generated by a local oscillator. The down-converted signal is then 
converted from an analog signal to a multi-bit digital signal by an analog-to-digital converter 
24. The digital signal may be filtered by a low pass filter to remove aliasing noise, resulting 
30 in multi-bit received digital signal samples. The received digital signal samples are changed 
if necessary to the form of a multi-bit digital signal having a chipping rate of the spreading 
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code originally used to modulate the digital information of the signal The received digital 
signal samples may additionally include two components, an I channel component 26 and a 
Q channel component 28, The I and Q channel signal components may be processed in a 
number of functions, such as in a rake receiver 30 including a despreader 34 to demodulate 
5 the received digital signal samples, and in a cell search circuit 32 including a despreader 36 
to provide synchronized tracking, as are mentioned below. The correlation information 
produced by the rake receiver and cell search may be further processed such as by processor 
42. 

An illustrative embodiment of a digital matched filter 60 that can be implemented in 
10 software, hardware such as an integrated circuit, or a combination of software or hardware, is 
shown in Figure 2. The integrated circuit may be, for example, be a macrocell in an 
application specific integrated circuit, a microprocessor, a microcontroller, or a digital signal 
^processor. Defining T as the number of taps in a straight-forward implementation, as 
^described above, of a digital filter. The T taps are factored into a product of N and L, where 
15 5f|N x L = T, and N and L are any integers that divide into T. Matched filter 60 is comprised of 
;tiT + N multiple bit delays. The use of the N additional delays ( as compared to the straight- 
H forward implementation) will become clear below. As will be explained below, the 
-invention can accommodate a number of taps that can not be divided into a product of two 
"Integers. 

20 0 An array 62 of multiple bit delays Dn through D N L+i are arranged as an N x L+l 
;3rray, or equivalent, illustratively having N rows and L-fl columns. In the illustrated 
embodiment, N is 4 although the invention is not limited thereto. The bit-width of each delay 
depends on system design, such as the number of bits in the output of analog-to-digital 
converter 24. Each row of delays in array 62 forms a row shift register 64i through 64 N (in 

25 the illustrated embodiment, 64] through 64 4 ) connected as shown. Each row shift register, 
when clocked, shifts or transfers data from the output of each stage to a "downstream" stage 
for storage therein, as is known in the art. The clocking is not illustrated in Figure 2; one 
skilled in the art would understand how to clock the delays. 

Digital data samples, each representing a sample at a sample instant, received on line 

30 68 are clocked into registers of buffer 66 to provide a parallel output from buffer 66 of N 
digital signal samples. Buffer 66 is comprised of N registers Di through D N (in the 
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illustrated embodiment, four registers Di through D4), which in a preferred embodiment are 
of the same, or greater, bit width as the bit width of delays Dn through Dn l+i- The output of 
registers Di through Dn of buffer 66 are coupled as inputs to respective shift registers 64i 
through 64 N as shown in Figure 2. Each of shift registers 64i through 64 N is connected to 
5 receive as an input an output from a corresponding register Di through Dn of buffer 66. In 
one embodiment, the digital data samples held in buffer 66 are not accessible to be a 
multiplier or multiplicand, and while held in buffer 66 do not contribute to an output from 
multiplier 60. 

The output of each multiple bit delay Dn through Dn l+i is coupled as a multiplicand 
10 input to a respective multiplier Mn through Mn l+i (in the illustrated embodiment, Mn 

through M4 l+i) as shown. A tap weight coefficient is provided as the multiplier input to each 
of multipliers Mn through M N l+i- In a preferred embodiment, the tap weight coefficients are 
^binary and may take on the values of a logic zero or a logic one. Since in the preferred 
^embodiment, the tap weight coefficients are single bits that are either a one or a zero, a 
15 , ; ' multiplication operation per se does not occur but rather the tap weight coefficients 

:J determine whether the output from a corresponding multiple bit delay contributes to sum 72 
< #or does not contribute to sum 72 produced by adder 70. 

= ^ In a CDMA application, coefficients C 0 through C T _i are one bit wide tap weights 
i representing the spreading code. However, the invention is not limited to one bit wide tap 
20 ipveights. Furthermore, not all tap weights must be of the same bit width. In non-CDMA 
^^applications, coefficients Co through Ct-i may be of a bit width of greater than one bit, but 
typically be of a bit width less than or equal to the bit width of delays Dn through Dn l+i, or 
less than or equal to the bit width of the digital signal samples. In applications where the tap 
weights are greater than one bit in width, a multiplication may occur to generate the outputs 
25 On through 0 N l+i. 

Each multiplier Mn through Mnl+i produces a corresponding output On through On 
l+i that is the product of the two inputs to each respective multiplier. The outputs On 
through On l+i (in the illustrated embodiment On through O4L+1) from corresponding 
multipliers Mn through Mnl+i are provided as inputs to adder 70. Outputs On through On 
30 l+i are combined by adder 70 to form sum 72, which is the output from adder 70. Sum 72 is 
the correlation of the spreading code represented by coefficients Co through Gr-i with 
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samples of the received signal. The magnitude of the correlation determines whether there is 
meaningful information to be extracted from the received signal samples. In a CDMA 
system, the correlation sum is used inter alia for synchronization, capture, synchronized 
tracking, and data demodulation. The correlation sum may be used for other purposes in 
5 non-CDMA applications. 

Coefficients C 0 through Ct-i may be provided by any technique to achieve the 
inventive digital filter 60. For example, the coefficients Co through Ct-i may be provided to 
tap changer 76 from a tap weight source 74, such as but not limited to random access 
memory (RAM), read only memory (ROM), or a processor. 
10 In operation, digital filter 60 illustrated in Figure 2 receives digital signal samples, for 

example from an analog-to-digital converter 24, on line 68. The digital signal samples are 
clocked at the spreading code clock rate into the registers Di through D N of buffer 66. In a 
1 ^ preferred embodiment, the digital data samples are individually clocked into registers Di 
! through D4 of buffer 66. For example, a digital signal sample is clocked into register D4. 
15 ; = ;The next digital signal sample is clocked into register D 3 . The next digital signal sample is 
^clocked into register D2, and the next digital signal sample is clocked into register Di. Each 
ulregister DI through D4 provides at its output the digital signal sample stored therein. Once 
; ^registers Di through D4 are updated with digital signal samples, the row shift registers are 
j ^clocked to shift the digital signal samples from the output of registers Di through D4 of buffer 
20 r g:66 into respective delays of the corresponding row shift registers 64i, 64 2 , 643, and 644. In 
; " jthis manner, after the row shift registers are clocked L+l times, all of the delays D n through 
Dn l+i are filled with valid digital signal samples. As is known in the art, until all of the 
delays are updated to contain valid digital signal samples, the correlation sum 72 may be 
discarded. Furthermore, while the buffer of N registers or delays, Di through D N) is clocked 
25 at the spreading clock frequency, the row shift registers 64i through 644 are clocked at a 
lower rate that is 1/N times the spreading clock frequency. With this reduction in the 
frequency of clocking the delay stages of the row shift registers there is a concomitant 
reduction in power consumption. 

As the digital signal samples are shifted into buffer 66, thence into row shift registers 
30 64i, 64 2 , 64 3 , and 64 4 , taps at the output of each delay Dn through Dnl+i provide the digital 
signal samples as multiplier inputs to multipliers Mn through M N l+i- Tap weight 
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coefficients, Co through Gr-i, such as from tap weight source 74, are provided to tap changer 
76 thence by tap changer 76 as the multiplicand input to multipliers Mn through Mnui- The 
product outputs On through On l +i produced by the multipliers are summed in adder 70 to 
produce the correlation sum 72 for one sample instant. 
5 In accordance with the invention, the coefficients C 0 through C T -i are shifted by tap 

changer 76 in coordination with the digital signal samples being shifted from buffer 66 
through registers Dn through Dnl+i so that the appropriate product terms (outputs On 
through On l+i ) are generated to contribute to a correlation sum at each sample instant. The 
tap changer shifts the coefficients Co through Gr-i at the spreading clock frequency. The row 
10 shift registers 64i, 642, 643, and 64 4 shift the digital data samples at a clock rate that is IfN 
times the spreading clock frequency. Since the coefficients in a preferred embodiment are 
one bit wide, the power required to shift the coefficients is relatively small by comparison to 
Shifting all of the digital signal samples at the spreading clock frequency. Even when the 
^coefficients have a bit width greater than one bit, there will be a power savings. For 
15 ftfexample, in a CDMA system, the tap weight coefficients may be one bit wide while the 
^digital data samples as well as the registers of the row shift registers are eight bits wide. 
w In order to achieve the same correlation sum as the correlation sum obtained in the 
i;; Straight-forward filter technique, the number of delay stages is increased to T + N from T. 
■ : ^The number of taps, multipliers, and inputs to adder 70 are also increased by N as compared 
20 \Mo the straight-forward filter technique. The tap weight coefficients, Co through Cj-i, are 
provided to and shifted into tap changer 76 from, for example, tap weight source 74. The 
spreading code tap weight coefficients are augmented with leading and trailing zeroes as 
illustrated in coefficient shift position 1 of Table 1 to form the augmented coefficients that 
are provided as the multiplicand input to multipliers Mn through Mnl*i. Although Table 1 
25 is prepared for the case N = 4, one skilled in the art can determine the tap weight coefficient 
shift pattern, and the multiplier associated with each tap weight coefficient in the more 
general case. 
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Coefficient 
Shift 
Position 

COEFFICIENTS 

1 

0 

0 

0 


c, 

c 2 


C-r-4 

Ct-3 


On 

0 

2 

0 

0 

Co 

Ci 

c 2 

C 


C T -? 

Ct-2 

C t .i 

0 

0 

3 

0 

Co 

c, 

c 2 

c 3 

c 4 


Qr-2 

Ct-i 

0 

0 

0 

4 

C„ 

Ci 

c 2 

c 2 

c ? 

C, 


C T -i 

0 

0 

0 

0 

Multi-plier 

M„ 

M M 

Mji 

M 41 

M 12 

M 22 


M 4L 

M 1L+1 

M 2L+l 

M 3L fi 

M 4L n 


TABLE 1 


5 Once a multiplication utilizing all of the coefficients in coefficient shift position 1 of Table 1 
takes place in multipliers Mn through Mnl+u in addition to the product outputs On through 
On l+i being provided as inputs to adder 70, tap changer 76 is clocked to rotate the 
augmented coefficients. Tap changer 76 may include a circular shift register or 
y ^combinational logic such that the augmented coefficients rotate through positions when tap 
10 ; ^ changer 76 is clocked. Once tap weights are loaded into tap changer 76, the tap weights 
^contribute to a correlation sum from the initial coefficient shift position, and tap changer 76 
^ is clocked N-l times to rotate the tap weights before the augmented coefficients are loaded 
; j !from tap weight source 74 into tap changer 76 again, thereby being reset to the stored 
coefficients in coefficient shift position 1. 
15 I'X The effect on the augmented coefficients of tap changer 76 being clocked can be seen 
■ j;in the rows of Table 1. Row 1 represents the augmented coefficients as retrieved from 
rjmemory. Upon clocking tap changer 76, each tap weight in row 1 of Table 1 representing 
coefficient shift position 1 is circularly shifted to the left. The tap weight in the left-most 
column associated with multiplier Mn is shifted into the right-most column, to be associated 
20 with multiplier M4 l+i in coefficient shift position 2 of Table 1 . Row 2 represents the 

augmented coefficients after tap changer 76 is clocked once. Row 3 represents the augmented 
coefficients after tap changer 76 is again clocked. Row 4 represents the augmented 
coefficients after tap changer 76 is again clocked. The augmented coefficients are shifted N - 
1 times in tap changer 76, for a total number of N positions. In each position, the augmented 
25 coefficients are provided as respective multiplicands to multipliers Mn through Mn l+i in the 
correlation process. The shift pattern represented in Table 1 is coordinated with the sequence 
of clocking digital signal samples into buffer 66 and the physical arrangement of multipliers 
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Mn through Mnl+i to assure the appropriate intermediate product terms, On through Onl+u 
are generated to produce the correlation sum 72 at each sample instant. 

An alternate illustrative embodiment digital matched filter 360 is shown in Figure 3 
that can be employed in a receiver and implemented in software, hardware, or some 
5 combination thereof. Elements in the embodiment illustrated in Figure 3 having reference 
numerals similar to reference numerals of elements in the embodiment illustrated in Figure 2 
have a similar function. Buffer 366 is a serial input, parallel output buffer. In the alternate 
embodiment digital matched filter 360, digital signal samples are clocked into register Di, 
and shifted through the registers of buffer 366 until each of the N buffer registers has been 
10 updated. With all of the registers Di through D N of buffer 366 updated with digital signal 
samples, the row shift registers 364i, 364 2 , 364 3 , and 364 4 are clocked to shift the digital 
signal samples from registers Di through Dn of buffer 366 into respective delays of the 
, ^corresponding row shift registers 364i, 364 2 , 364 3 , and 3644. In this manner, after the row 
Sshift registers are clocked L+l times, all of the delays Dn through Dnl+i in shift registers 
15 " II 364i, 364 2 , 364 3 , and 364 4 are filled with valid digital signal samples. Furthermore, while 
jjeach of the N registers of buffer 366, Di through Dn, is clocked at the spreading clock 
^frequency, the row shift registers 364i through 364 4 are clocked at a lower rate that is 1/N 
uptimes the spreading clock frequency. Filter 360 has much of the power consumption 
I ^reduction of filter 60. In filter 360, there are N - 1 more registers of buffer 366 clocked at 
20 v -|ihe spreading clock frequency than in filter 60. With this reduction in the frequency of 

^clocking the delay stages of the row shift registers there is a concomitant reduction in power 
consumption. 

While the illustrative embodiments of the invention have been described with respect 
to a CDMA communication system, the invention is not limited to being used in CDMA 
25 systems. The invention may be employed in any filter application, and may be fabricated in 
an integrated circuit using any known technology. 

While the illustrative embodiment of the invention has been described as filling the 
delay buffer starting with delay D4, and ending with delay DI, the invention is not limited 
thereto. Other sequences of filling the buffer are within the scope of the invention. 
30 Coordination between the sequence of filling registers of the buffer, augmenting the tap 
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weights with leading and trailing zeroes, and circular shifting of the augmented tap weights is 
necessary to produce the appropriate products to sum. 

The invention has application where the bit width of tap weight coefficients is less 
than the bit width of the digital signal samples. Since the coefficients in a preferred 
5 embodiment are one bit wide, the power required to shift the coefficients is relatively small 
by comparison to shifting all of the digital signal samples at the spreading clock frequency. 

It was stated above that the T taps are factored into a product of N and L, where N x 
L = T, and N and L are any integers that divide into T, matched filter 60 is comprised of T + 
N multiple bit delays, and that the invention can accommodate a number of taps that can not 
10 be divided into a product of two integers. As can be seen from the right column of Table 1, in 
the illustrated embodiment, the tap weight of multiplier M4L+1 is always zero. Thus, there is 
no contribution to the sum from multiplier M4L+1, the multiplier can be eliminated, delay D4 
^1 can be eliminated, and there is no need to provide output O4U.1 to adder 70. This departs 
: Ifrom an array 62 of delays that is N x L+L The number of delay stages in array 62 can be 
15 1 1 [factored into NxL+1, with a remainder of R, where R < N. The R delays should be included 
jin the shift registers such that there is no sample instant gap between the data samples in the 
; i}shift registers. Where the R delays are in the array is dependent in part on the sequence of 
^ filling the buffer registers. In Figures 2 and 3, for example, if R registers were present, they 
I " would form the rightmost column of the array, and would fill in delay stages beginning from 
20 rpthe top of the array as illustrated. An example of a digital filter in which the number of taps 
V;|s not a product of two integer numbers is illustrated in Figure 4. 
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